tRLD INTELLECTUAL PROPERTY ORGAN IZATIQ 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

C12N 5/10, 15/09, 15/12, 15/62, 15/63, 
15/79, C07K 19/00, 14/705 



Al 



(11) International Publication Number: WO 95/34641 

(43) International Publication Date: 21 December 1995 (21.12.95) 



(21) International Application Number: PCT/US95/07542 

(22) International Filing Date: 13 June 1995 (13.06.95) 



(30) Priority Data: 

08/260,514 



15 June 1994 (15.06.94) 



US 



(71) Applicant: THE SCRIPPS RESEARCH INSTITUTE [US/US]; 

10666 North Torrey Pines Road, TPC-8, La Jolla, CA 92037 
(US). 

(72) Inventors: MUIR, Thomas, W.; 3745 Ingraham Street, San 

Diego, CA 92109 (US). WILLIAMS, Michael, J.; 4029 
Carmel View Road, #149, San Diego, CA 92130 (US). 
GINSBERG, Mark, H.; 2944 Renault Place, San Diego, 
CA 92122 (US). KENT, Stephen, B.H.; 6046 La Jolla 
Hermosa Avenue, La Jolla, CA 92037 (US). CHEN, Yi- 
Ping; 8155 Cargill Avenue, #40, San Diego, CA 92122 
(US). O'TOOLE, Timothy, E.; 4600 Lamont #4-326, San 
Diego, CA 92109 (US). 

(74) Agents: LOGAN, April, C. et al.; The Scripps Research 
Institute, 10666 North Torrey Pines Road, TPC 8, La Jolla, 
CA 92037 (US). 



(81) Designated States: AU, CA, JP, European patent (AT, BE, 
CH, DE, DK, ES, FR, GB, GR, EE, IT, LU, MC, NL, PT, 
SE). 



Published 

With international search report 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: STRUCTURAL MODELS FOR CYTOPLASMIC DOMAINS OF TRANSMEMBRANE RECEPTORS 
(57) Abstract 

The present invention comprises model systems that allow study and control of the activity of the intracellular or cytoplasmic regions 
of integrins and other transmembrane proteins. One aspect of these model systems is protein constructs that contain helical regions in order 
to approximate the effect of the transmembrane hydrophobic regions while maintaining solubility. One type of protein construct according 
to the present invention comprises: (1) a first segment including two amino acid sequences joined at their amino-terminal residues in 
head-to-head fashion, each of the two amino acid sequences of the first segment including a substantially helical amphiphilic amino acid 
region; and (2) two second segments, each of the second segments joined to the first segment at the carboxyl terminus of each of the two 
amino acid sequences of the first segment. The protein construct has one free alpha-amino terminus derived from one of the amino acid 
sequences of the first segment and two free carboxyl termini. Another type of protein construct according to the present invention comprises 
a first segment, two second segments, and two third segments. Also within the scope of the invention are chimeric integrins and methods 
for their use. 
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STRUCTURAL MODELS FOR CYTOPLASMIC DOMAINS 
OF TRANSMEMBRANE RECEPTORS 

BACKGROUND OF THE INVENTION 

This invention is directed to structural models for 
cytoplasmic domains of transmembrane receptors, including 
protein constructs and chimeric proteins, as well as 
methods of synthesis and use of such structural models. 

In eukaryotic cells, many proteins extend through 
the membrane and therefore have a cytoplasmic domain, a 
transmembrane domain, and an extracellular domain. Many 
of these proteins are involved in signal transduction, 
the control of cell adhesion, and cell-cell interaction 
control . 

Among the proteins that fall into this category are 
the integrins (R.O. Hynes, "Integrins: Versatility, 
Modulation, and Signalling in Cell Adhesion," Cell 69:11- 
25 (1992); S.M. Albelda & C.A. Buck, "Integrins and Other 
Cell Adhesion Molecules," FASEB J. 4: 2868-2880 (1990); 
E. Ruoslahti, "Integrins," J. Clin. Invest. 87: 1-5 
(1991) ; M.E. Hemler, "VLA Proteins in the Integrin 
Family: Structures, Functions, and Their Role on 
Leukocytes," Annu . Rev . I mmuno 1 . 8:365-400 (1990)). 

Integrins are involved in a number of pathological 
and physiological processes, including thrombosis, 
inflammation, and cancer (M.J. Humphries et al., "A 
Synthetic Peptide From Fibronectin Inhibits Experimental 
Metastasis of Murine Melanoma Cells," Science 233:467-470 
(1986); E. Ruoslahti, supra: 1-M.E. Hemler (1990), 
supra.) . Other physiological and pathological conditions 
involving changes in cell adhesiveness are also mediated 
through integrins . 

Typically, transmembrane proteins are heterodimeric , 
being noncovalent associations of two or more different 
types of polypeptide subunits. In particular, integrins 
are heterodimers of two different protein subunits, 
designated a and 0. The or subunits vary in size between 
120 and 180 kDa and are each noncovalently associated 
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with a jS subunit. The extracellular domain of the 
integrin molecule forms a ligand binding site; both the 
a and /S subunits are involved in forming the ligand 
binding site. A number of different ligands for 
5 integrins are known are known, including collagens, 

laminin, fibronectin, vitronectin, complement components, 
thrombospondin, and integral membrane proteins of the 
immunoglobulin superfamily such as ICAM-1, ICAM-2, and 
VCAM-1. The integrins recognize various short peptide 

10 sequences in the ligands. Examples of these are Arg-Gly- 

Asp (RGD) , Lys-Gln-Ala-Gly-Asp-Val (KQAGDV) (SEQ ID NO: 
1), Asp-Gly-Glu-Ala (DGEA) (SEQ ID NO: 2), Glu-Ile-Leu- 
Asp-Val (EILDV) (SEQ ID NO: 3), and Gly-Pro-Arg-Pro 
(GPRP) (SEQ ID NO: 4) . Variations in integrin function 

15 are often caused by changes in the ligand binding 

affinity of the extracellular domain of the integrins 
(J.S. Bennett & G. Vilaire, J. Clin. Invest. 64: 1393- 
1401 (1979); D.C. Altieri et al . , J. Cell Biol. 107: 
1893-1900 (1988); R.J. Faull et al . , J. Cell Biol. 121: 

20 155-162 (1993); B.A. Lollo et al . , J. Biol. Chem. 

268 : 216 93 -21700 (1993) ) . 

Integrin ot llb (3 3 (platelet GPIIb-IIIa) , a heterodimer of 
two type I transmembrane protein subunits (R.O. Hynes 
(1992) , supra) , manifests highly regulated changes in 

25 ligand binding affinity. Affinity state-specific 

antibodies, e.g., PAC1 (S.J. Shattil et al . , J. Biol. 
Chem. 260:1107-1114 (1985)), are useful for analysis of 
recombinant a llb (3 3 in heterologous cells (T.E. O' Toole et 
al., Cell Regulation 1:883-893 (1990)). Platelet 

30 agonists increase the affinity of ar IIb j3 3 (activation) 

probably by causing changes in the confirmation of the 
extracellular domain (T.E. O'Toole et al . (1990), supra; 
P.J. Sims et al . , J. Biol. Chem. 266:7345-7352 (1991)). 
Cytoplasmic signaling pathways involving heterotrimeric 

35 GTP binding proteins, phospholipid metabolism, and 

serine-threonine kinases initiate these conf irmational 
changes in the extracellular domain; these changes may 
also involve calcium fluxes, tyrosine kinases, and low 
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molecular weight GTP binding proteins (P.J. Sims et al . 
(1991), supra; S.J. Shattil et al . , J. Biol. Chem. 267: 
18424-18431 (1992); S.J. Shattil & J.S. Brugge, Curr . 
Qpin. Cell Biol . 3 : 869-879 (1991); M.H. Ginsberg et al ., 
Cold Spring Harbor Symposium of Quantitative Biology: 
The Cell Surface 57: 221-231 (1992); M.H. Ginsberg et 
al . , "Inside-Out Integrin Signalling," Curr. Qpin. Cell 
Biol. 4: 766-771 (1992); Y. Nemoto et al . , J. Biol. Chem. 
267: 20916-20920 (1992)) . How cytoplasmic signals result 
in changes in the conformation and ligand binding 
affinity of the extracellular domain ("inside-out signal 
transduction") of the integrin remains obscure. Studies 
with chimeras containing the cytoplasmic domains of 
various a and (3 subunits joined to the transmembrane and 
extracellular domain of or nb /3 3 indicates that integrin 
cytoplasmic domains transduce cell type-specific signals 
that modulate ligand binding affinity. These signals 
require active cellular processes in both a and (3 
cytoplasmic tails of the integrin, suggesting that they 
reflect physiologically relevant signals. In addition, 
deletion of a highly conserved motif, Gly-Phe-Phe-Lys-Arg 
(GFFKR) (SEQ ID NO: 5) , at the amino- terminus of the a 
subunit cytoplasmic domain, also resulted in high 
affinity binding of ligands to integrin af n /? 3 . In contrast 
to the chimeras, high affinity ligand binding to GFFKR 
(SEQ ID NO: 5) deletion mutants was independent of 
cellular metabolism, cell type, and the bulk of the /3 
subunit cytoplasmic domain. Thus, integrin cytoplasmic 
tails are targets for the modulation of integrin 
affinity. 

Although signal transduction by integrins, as well 
as by transmembrane proteins generally, is of great 
interest, technical difficulties have greatly limited the 
application of high resolution techniques for the 
structures of these proteins. Indeed, molecular 

structures are available for only two intact 
transmembrane proteins, a bacterial photoreaction center 
(J. Deisenhofer et al . , Nature 318: 618-624 (1985)), and 
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a porin (M.S. Weiss et al . , FEBS Lett. 267: 268-272 
(1990) ) . Structures of receptor extracellular domains 
have been determined using soluble truncated 
extracellular domains as models (A.M. DeVos et al . , 
Science 255: 306-312 (1992); M.V. Milburn et al., Science 
254: 1342-1347 (1991)). These structures have 

contributed to the understanding of the basis of ligand 
recognition, but have provided less insight into the 
mechanism of signal transduction. Many membrane proteins 
that transduce signals are members of the Type I 
transmembrane protein family, the defining feature of 
which is a single membrane spanning region. These 
include the T cell receptor (A.. Weiss, Cell 73: 209-212 
(1993)); growth factor receptors (L. Patthy, Cell 61: 13- 
14 (1990), and cytokine receptors (A. Miyajima et al . , 
TIBS 17: 378-382 (1992)). In general, the cytoplasmic 
domain of these proteins is critical for signaling. 
Thus, to understand signal transduction through such 
receptors, it is essential to understand the structure 
and function of the cytoplasmic domain. This is 
especially difficult for multisubunit Type I proteins. 

An additional complication in studying and in 
developing methods of controlling or modulating the 
activity of integrins and other transmembrane proteins is 
the inherent hydrophobic ity of membrane - spanning 
stretches of apolar amino acids. This means that 
structural studies on truncated receptors containing 
these helical regions will be impeded by insolubility in 
aqueous solution. 

Therefore, there is a need for improved methods of 
controlling and modulating the activity of integrins and 
other transmembrane proteins, of detecting their 
activity, and modulating their activity to detect and 
control physiological conditions. 



To meet these needs, we have developed model systems 
that allow study and control of the activity of the 
intracellular or cytoplasmic regions of integrins and 



SUMMARY 
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other transmembrane proteins. Particularly, these model 
systems maintain the cytoplasmic regions in their natural 
conformation. These model systems include protein 
constructs and chimeric integrins. 

One aspect of the present invention is protein 
constructs intended to link cytoplasmic regions of 
transmembrane proteins in a framework providing 
solubility in aqueous media. In general, a protein 
construct according to the present invention comprises: 

(1) a first segment including two amino acid 
sequences joined at their amino- terminal residues in 
head-to-head fashion, each of the two amino acid 
sequences of the first segment including a substantially 
helical amphiphilic amino acid region; and 

(2) two second segments, each of the second 
segments joined to the first segment at the carboxyl 
terminus of each of the two amino acid sequences of the 
first segment; the protein construct having: (i) either 
no free a-amino terminus or one free a-amino terminus 
derived from one of the two amino acid sequences of the 
first segment and (ii) two free carboxyl termini. 

One protein construct according to the present 
invention comprises : 

(1) a first segment including two amino acid 
sequences joined at their amino- terminal residues in 
head- to-head fashion, each of the two amino acid 
sequences of the first segment including a substantially 
helical amphiphilic amino acid region; and 

( 2 ) two second segments , each of the second 
segments joined to the first segment at the carboxyl 
terminus of each of the two amino acid sequences of the 
first segment. 

The protein construct has one free a-amino terminus 
derived from one of the two amino acid sequences of the 
first segment and two free carboxyl termini. This is 
referred to as a Type I protein construct. 

Typically, the substantially helical amphiphilic 
acid regions of the first segment have a predominantly 
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periodic secondary structure. Preferably, the 

substantially helical amphiphilic amino acid regions in 
the first segments have an estimated helicity of at least 
about 80%; more preferably, the helicity is at least 
5 about 85%. 

Typically, the substantially helical amphiphilic 
amino acid regions each have the sequence G- (X 1 -L-X 2 -X 3 -L- 
X 4 -G) n , wherein X l is selected from the group consisting of 
lysine, arginine, and ornithine, X 2 and X 4 are each 

10 independently selected from the group consisting of 

aspartic acid and glutamic acid, X 3 is selected from the 
group consisting of alanine, serine, and threonine, and 
n is an integer from 2 to 20. Preferably, n is from 3 to 
6; more preferably, n is 4 . Preferably, X! is lysine, X 2 

15 and X 4 are each glutamic acid, and X 3 is alanine. 

Typically, n is identical for both of the 
substantially helical amino acid regions; preferably, the 
sequence of both of the substantially helical amphiphilic 
amino acid regions is identical. However, these are not 

2 0 absolute requirements, and constructs in which n is 

different for the substantially helical amino acid 
regions or in which the sequence of the substantially 
helical amino acid regions differs are within the scope 
of the invention. 

2 5 Typically, each of the two second segments has a 

length of about 15 amino acids to about 50 amino acids. 
The two second segments can be non-helical. Typically, 
the amino acid sequences of each of the two second 
segments are derived from the cytoplasmic domain of a 

3 0 transmembrane protein. More typically, the amino acid 

sequences of each of the two second segments are derived 
from the amino acid sequences of the cytoplasmic domains 
of the subunits of a heterodimeric multisubunit 
transmembrane protein where the subunits noncovalently 
3 5 associate in vivo. The amino acid sequences of each of 

the two second segments can be derived from the 
cytoplasmic domain of an integrin. In this case, 
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20 



25 



30 



preferably the amino acid sequence of one of the second 
segments is derived from the cytoplasmic domain of an 
integrin selected from the group consisting of ct lt a 2 , (* 3 , 
of 4 , a 5 , a 6 , ot lt a s , a llb , a v , a L , a Mt a x , and o? IEL# and the 
amino acid sequence of the second of the second segments 
is derived from the cytoplasmic domain of an integrin 
selected from the group consisting of (3 2 , /? 3 , (3 4 , (3 5 , 

(3 6f (3 7f and /3 8 , so that one of the following combinations 
is formed: ot { fi lt ot 2 p lt a 3 fi lt ot A $ Xt a 4 (3 lf a 5 (3 {f a 6 p x , a 6 0 4 , a 7 (3 x , 
a g (3 u a y p x , ot y (3 3 , ct v (3 5 , a y (3 6f a v ^ t a L (3 2 , ot M (3 2f a x (3 2 , a Ub 0 3f and 

A particularly preferred embodiment is one in which 
the amino acid sequence of one of the second segments is 
derived from the cytoplasmic domain of integrin a nb and 
the amino acid sequence of the second of the second 
segments is derived from the cytoplasmic domain of 
integrin /3 3 . In this embodiment, the amino acid sequence 
of the first of the second segments can be residues 98 9- 
1007 of integrin of IIb/ with an additional carboxyl- terminal 
glutamine, and the amino acid sequence of the second of 
the second segments can be that of residues 716-762 of 
integrin £ 3 . 

Preferably, the helicity of at least one of the 
first and second segments is increased in the construct 
over the helicity of the first or second segment alone. 

Typically, the two amino acid sequences of the first 
segment are linked by a thioether linkage between a 
cysteine residue at the amino terminus of one of the 
amino acid sequences of the first segment and a 
bromoacetyl moiety at the amino terminus of the second of 
the amino acid sequences of the first segment. Other 
linkages can also be used. 

In an alternative Type I protein construct according 
to the present invention, the two amino acid sequences 
are linked by an oxime linkage between an aldehyde moiety 
and an aminooxy moiety. This alternative Type I protein 
construct can comprise: 
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(1) 



a first segment including two amino acid 
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15 



20 



25 



30 



sequences joined at their amino- terminal residues in 
head- to-head fashion through an oxime linkage formed by- 
reaction of an aldehyde moiety with an aminooxy moiety, 
each of the two amino acid sequences of the first segment 
including a substantially helical amphiphilic amino acid 
region; and 



segments joined to the first segment at the carboxyl 
terminus of each of the two amino acid sequences of the 
first segment. 

This alternative Type I protein construct has either zero 
or one free amino terminus derived from one of the two 
amino acid sequences of the first segment and two free 
carboxyl termini, with the proviso that when the protein 
construct has one free amino terminus, the amino acid 
residue having the aminooxy moiety has an amino group 
derived from a side chain, the free amino terminus of the 
protein construct being derived either from the a-amino 
terminus or from the amino terminus of the side chain. 

One of the second segments can have an amino acid 
sequence derived from the cytoplasmic domain of a subunit 
of an integrin with a deletion of a sequence G-F-F-K-R. 

A particularly preferred protein construct 
comprises: 

(1) a first segment including two copies of a 
substantially helical amphiphilic amino acid sequence 
that is G- (K-L-E-A-L-E-G) 4 , joined by a thioether linkage 
formed by reaction of a cysteine residue that is linked 
to the amino -terminal end of one of the two amino acid 
sequences and a bromoacetyl moiety that is linked to the 
amino- terminal end of the second amino acid sequence; and 

(2) two second segments, each covalently linked to 
a carboxyl terminus of the first segment, wherein one of 
the second segments has the amino acid sequence of 
residues 989-1007 of integrin ct Ub with an additional 
carboxyl -terminal glutamine residue, and the second 
second segment is residues 716-762 of integrin £ 3 . 



(2) 



two second segments, each of the second 
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Another embodiment of the invention is a protein 
construct referred to herein as a Type II protein 
construct, which comprises: 

(1) a first segment including two amino acid 
sequences joined at their amino- terminal residues in 
head- to-head fashion so that the first segment has no 
free amino terminus and two carboxyl termini available 
for covalent linkage to the amino- terminal end of another 
amino acid segment; 

(2) two second segments, each including a 
substantially helical amphiphilic amino acid region, each 
of the second segments being covalently linked at their 
amino -terminal ends to a carboxyl terminus of the first 
segment ; and 

(3) two third segments, each of the third segments 
being an amino acid sequence, each of the third segments 
being covalently linked by their amino-termini to the 
carboxyl -termini of the second segment, the protein 
construct having no free amino terminus and two free 
carboxyl termini derived from the third segments. 

The second and third segments of this Type II 
protein construct are analogous to the first and second 
segments of the Type I protein construct, but the Type II 
protein construct has additional residues between the 
helical regions and the covalent linkage. The first 
segment can include a specific binding partner sequence 
having affinity for a specific binding partner. The 
specific binding partner sequence can specifically bind 
an antibody. 

Another aspect of the present invention is a method 
of producing protein constructs. For Type I protein 
constructs using a thioether linkage, this method can 
comprise the steps of : 

(1) providing two amino acid sequences, each amino 
acid sequence including: (i) 'an amino -terminal amino acid 
segment; (ii) an intermediately situated substantially 
helical amphiphilic amino acid segment covalently linked 
at its amino terminus to the first segment; and (iii) a 
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third amino acid segment covalently linked at its amino 
terminus to the second amino acid segment within the 
amino acid sequence; and 

(2) joining the two amino acid sequences in head- to- 
head fashion through a thioether linkage to form a 
protein construct having one free a-amino terminus 
derived from one of the two amino acid sequences and two 
free carboxyl termini . 

For Type I protein constructs using an alternative 
oxime linkage, the method can comprise the steps of: 

(1) providing two amino acid sequences, each amino 
acid sequence comprising a substantially helical 
amphiphilic amino acid region at its amino terminus 
covalently linked to a second amino acid sequence; and 

(2) joining the two amino acid sequences in head-to- 
head fashion through an oxime linkage between an amino 
acid residue with an aldehyde moiety and an amino acid 
residue with an aminooxy moiety to form a protein 
construct having either zero or one free amino terminus 
derived from one of the two amino acid sequences of the 
first segment and two free carboxyl termini, with the 
proviso that when the protein construct has one free 
amino terminus, the amino acid residue having the 
aminooxy moiety has an amino group derived from a side 
chain, the free amino terminus of the protein construct 
being derived either from the a-amino terminus or from 
the amino terminus of the side chain. 

For Type II protein constructs using a thioether 
linkage, the method can comprise the steps of: 

(1) providing two amino acid sequences, each amino 
acid sequence including: (i) an amino- terminal amino acid 
segment; (ii) an intermediately situated substantially 
helical amphiphilic amino acid segment covalently linked 
at its amino terminus to the first segment; and (iii) a 
third amino acid segment covalently linked at its amino 
terminus to the second amino acid segment within the 
amino acid sequence; and 

(2) joining the two amino acid sequences in head-to- 
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head fashion through an oxime linkage between an amino 
acid residue with an aldehyde moiety and an amino acid 
residue with an aminooxy moiety to form a protein 
construct having either zero or one free amino terminus 
derived from one of the two amino acid sequences of the 
first segment and two free carboxyl termini, with the 
proviso that when the protein construct has one free 
amino terminus, the amino acid residue having the 
aminooxy moiety has an amino group derived from a side 
chain, the free amino terminus of the protein construct 
being derived either from the a-amino terminus or from 
the amino terminus of the side chain. 

For alternative Type II protein constructs using an 
oxime linkage, the method can comprise the steps of: 

(1) providing two amino acid sequences, each amino 
acid sequence including: (i) an amino- terminal amino acid 
segment; (ii) an intermediately situated substantially 
helical amphiphilic amino acid segment covalently linked 
at its amino terminus to the first segment; and (iii) a 
third amino acid segment covalently linked at its amino 
terminus to the second amino acid segment within the 
amino acid sequence ; and 

(2) joining the two amino acid sequences in head- to- 
head fashion through an oxime linkage between an amino' 
acid residue with an aldehyde moiety and an amino acid 
residue with an aminooxy moiety to form a protein 
construct having either zero or one free amino terminus 
derived from one of the two amino acid sequences of the 
first segment and two free carboxyl termini, with the 
proviso that when the protein construct has one free 
amino terminus, the amino acid residue having the 
aminooxy moiety has an amino group derived from a side 
chain, the free amino terminus of the protein construct 
being derived either from the Qf-amino terminus or from 
the amino terminus of the side chain. 

Another aspect of the present invention is chimeric 
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integrin proteins and methods for their use. 

Chimeric integrin proteins according to the present 
invention can comprise: (1) a chimeric integrin protein 
comprising the extracellular and transmembrane domains of 
5 the Tac subunit of the IL-2 receptor covalently linked to 

the cytoplasmic domain of integrin /3 3 ; (2) a chimeric 
integrin protein comprising the extracellular and 
transmembrane domains of the Tac subunit of the IL-2 
receptor covalently linked to the cytoplasmic domain of 

10 integrin j8 3 with amino acid 752 being mutated from a 

serine residue to a proline residue; (3) a chimeric 
integrin protein comprising the transmembrane and 
extracellular domains of the Tac subunit of the human IL- 
2 receptor covalently linked to the cytoplasmic domain of 

15 integrin a IIb ; and (4) a heterodimeric chimeric integrin in 

which the extracellular and transmembrane domains of 
human integrin a IIb /? 3 are joined to the cytoplasmic domains 
of human integrin a 5 (5 x . 

Another aspect of the invention is nucleic acid 

20 sequences encoding the chimeric integrin proteins. Yet 

another aspect of the invention is these nucleic acid 
sequences operatively linked to at least one control 
element for transcription of the nucleic acid sequence, 
as well as vectors comprising the nucleic acid sequences 

25 operatively linked to the control elements, the vectors 

being capable of transfecting at least one eukaryotic 
host for expression of the chimeric integrin proteins 
encoded by the nucleic acid sequences. 

Another aspect of the invention is. a method for 

3 0 blocking the activation of a human cellular integrin 

comprising the step of expressing a chimeric integrin 
whose extracellular and transmembrane domains are derived 
from the Tac subunit of the human IL-2 receptor and whose 
cytoplasmic domain is the cytoplasmic domain of the human 

35 /?! integrin in a quantity sufficient to inhibit high 

affinity ligand binding by the cellular integrin. 
Alternatively, in this method, the chimeric integrin 
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protein comprising the extracellular and transmembrane 
domains of the Tac subunit of the IL-2 receptor 
covalently linked to the cytoplasmic domain of integrin 
0 3 can be used. 



These and other features, aspects, and advantages of 
the present invention will become better understood with 
reference to the following description, appended claims, 
and the accompanying drawings where : 

Figure 1A is a schematic drawing of the presumed 
topology of the native integrin a lIb /? 3 ; 

Figure IB is a schematic drawing of a Type I protein 
construct of the present invention designated MP-1; 

Figure 2 is a depiction of the amino acid sequences 
of the synthetic polypeptide targets, including a helix- 
dimer, MP-1, and MP-2, showing the relationships between 
the sequences in the assembled molecules; 

Figure 3 is a schematic equation showing the 
synthesis of MP-1 using chemical dovetailing; 

Figure 4 shows graphs of analytical HPLC results 
illustrating the effect of reaction conditions on 
formation of the thioether bond in the chemical 
dovetailing step as a function of time: (4(a), aqueous 
solution (0.1 M phosphate, pH 7); 4(b), 
dimethylf ormamide-0 . 1 M phosphate (95:5), pH 7) ; 

Figure 5 shows the results of studies intended to 
characterize the assembled polypeptides MP-1 (5(a)) and 
MP-2 (5(b)); main panel, ion-spray mass spectra showing 
charge state distribution; insets, analytical reverse 
phase HPLC spectra; 

Figure 6 shows the results of circular dichroism 
studies of synthetic polypeptides containing the 
individual cytoplasmic domains of the integrin a Ub j6 3 (6(a) , 
isolated cytoplasmic domains, 6(b), cytoplasmic domains 
attached to helical segments) ; 

Figure 7 shows the results of circular dichroism 
studies of MP-1, MP-2, and the covalent helix-dimer; 

Figure 8 shows the results of fluorescence quenching 
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studies on MP-1, MP-2, isolated /3 3 , and isolated (S 3 + 
isolated ar nb ; 

Figure 9 is a schematic depiction of various 
chimeric integrins and Tac chimeras, including the amino 
acid sequences of the cytoplasmic regions of the 
integrins; 

Figure 10 shows histograms depicting the results of 
flow cytometry in which fluorescence intensity is 
depicted on the abscissa and cell number on the ordinate; 
in Panels A, B , and C, PAC1 binding in the absence 
(filled histogram) or presence (open histogram) of 
competitive inhibitor is shown; in Panel D, surface 
expression of a IIb /3 3 is shown as reported by the binding of 
D57 antibody in the absence (filled histogram) or 
presence (open histogram) of co- transf ected Tac-/? 3 DNA; Ml 
indicates the region containing those cells that express 
the recombinant ot nb (3 3 construct bearing the cytoplasmic 
domains of (S { and a 5 ; 

Figure 11A is a graph showing the inhibition of 
inside-out signaling by Tac chimeras ( (■) (3 3 , (•) (3 lt (a) 
ct 5 ) used to transf ect CHO cells, with signaling assessed 
by PAC1 binding; 

Figure 11B is a graph showing the expression of the 
Tac chimeras used in Figure 11A assessed by the binding 
of the anti-TAC antibody 7G7B6; 

Figure 12A is a graph showing the effect of the 
mutation S 752 P on inside-out signaling in cells transfected 
with chimeras as in Figure 11A ((a) Tac-/8 3 (S 752 P) ; (■) Tac- 



Figure 12B is a graph showing the expression of the 
Tac chimeras used in Figure 11A assessed as in the 
experiments shown in Figure 11A and 11B; 

Figure 13A is a graph showing the effect of a Tac-jSj 
chimera on the affinity of a "hinge" mutant, a L A ( (•) a 5 ; 
(■) a L A) , by cotransf ect ion as in Figure 11A; and 

Figure 13B is a graph showing the expression of the 
Tac-/?! chimera in the experiments of Figure 13A assessed 
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as in the experiments shown in Figure 11A and 11B. 



In the context of this disclosure, including the 
description, examples, and claims, the following terms 
are defined as follows unless otherwise indicated: 

" Protein Construct " means a molecule constructed 
from two or more protein segments that are covalently 
joined in such a way that the molecule has one free of- 
amino terminus derived from one of the protein segments 
but two or more free carboxyl termini . 

"Nucleic Acid" includes both DNA and RNA unless 
otherwise indicated, and can include both single and 
double -stranded nucleic acid sequences. If a DNA 
sequence is referred to, references generally to both 
strands of a DNA sequence, either individually or as a 
Watson-Crick double helix. If only one strand is 
specified, the complementary strand whose antiparallel 
sequence is determined by Watson-Crick based pairing 
rules is also included unless the complementary sequence 
is specifically excluded. If only one strand is 
specified in double -stranded DNA, the strand specified as 
the sense strand whose strand would be equivalent to the 
sequence of any RNA transcribed from the double -stranded 
DNA, except for the replacement of thymidine (T) in the 
DNA by uridine (U) in the RNA. Reference to a nucleic 
acid sequence also includes modified bases as long as the 
modification does not significantly interfere with 
Watson-Crick base pairing or other specified functions of 
the nucleic acid, and can, for example, include 
substitution of uridine for thymidine in DNA as well as 
methylation of bases or modification of sugars. 

"Helical" refers to any periodic regular secondary 
structure of amino acids in a protein, domain of a 
protein, or peptide characterized by a uniform 
translation along an axis, and includes but is not 
limited to an cx-helix unless so specified. 

"Periodic" refers to any regular secondary structure 
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of amino acids in a protein that is not a random coil or 
a quasi-random coil, and includes, but is not limited to, 
a helical structure. 

"Specific binding partner" refers to a member of a 
pair of molecules that interact by means of specific non- 
covalent interactions that depend on the three- 
dimensional structures of the molecules involved, such as 
salt links, hydrophobic interactions, and van der Waals 
interactions. Typical pairs of specific binding partners 
inc lude ant igen - ant ibody , hapten - ant ibody , hormone - 
receptor, nucleic acid strand- complementary nucleic acid 
strand, substrate-enzyme, inhibitor-enzyme, carbohydrate- 
lectin, biotin-avidin, and virus-cellular receptor. 

"Antibody 11 includes both intact antibodies and 
antibody fragments possessing an antigen-binding site, as 
well as chimeric antibodies. 

"Amino terminus " refers to an amino group or moiety 
that is part of an amino acid residue within a protein, 
polypeptide, or protein construct. The amino group can 
either be an or-amino terminus or an amino terminus from 
a side chain, such as the e -amino group of lysine. 

"Amino- terminal residue" refers to an entire amino 
acid residue possessing an amino terminus. The amino- 
terminal residue can be joined to other amino acid 
residues, either through its carboxyl terminus by a 
peptide bond, or through a side chain. 
I . PROTEIN CONSTRUCTS 

One aspect of the present invention is protein 
constructs that maintain the cytoplasmic tails of 
transmembrane proteins in conformations approaching their 
naturally occurring conformations while preserving 
solubility in an aqueous medium and obviating the 
difficulty of working with proteins containing highly 
hydrophobic transmembrane segments. 

These protein constructs allow study of the 
conformation and activity of the cytoplasmic regions of 
transmembrane proteins, including integrins, and can be 
used, as discussed below, in screening and in controlling 



BNSDOCID: <WO 9534641 A1> 



WO 95/34641 




PCI7US95/07542 



-17- 



the activity of integrins and other transmembrane 
proteins . 

A. Structure of Protein Constructs 

In general, a protein construct according to the 
present invention comprises : 

(1) a first segment including two amino acid 
sequences joined at their amino- terminal residues in 
head-to-head fashion, each of the two amino acid 
sequences of the first segment including a substantially 
helical amphiphilic amino acid region; and 

(2) two second segments, each of the second 
segments joined to the first segment at the carboxyl 
terminus of each of the two amino acid sequences of the 
first segment; 

the protein construct having: (i) either no free a- amino 
terminus or one free a-amino terminus derived from one of 
the two amino acid sequences of the first segment and 
(ii) two free carboxyl termini. 

1 . Protein Construct With First and Second 
Segments 

One aspect of the present invention is a protein 
construct with first and second segments. In general, 
this protein construct comprises: 

(1) a first segment including two amino acid 
sequences joined at their amino- terminal residues in 
head-to-head fashion, each of the two amino acid 
sequences of the first segment including a substantially 
helical amphiphilic amino acid region; and 

(2) two second segments, each of the second 
segmen ts joined to the first segment at the carboxyl 
terminus of each of the two amino acid sequences of the 
first segment. The protein construct has one free a- 
amino terminus derived from one of the two amino acid 
sequences of the first segment and two free carboxyl 
termini. For convenience, this protein construct is 
referred to herein as a Type I protein construct. 

The free a-amino terminus derived from one of the 
two amino acid sequences of the first segment is free in 
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the sense that it is not involved in a peptide bond. 
However, unlike the situation in naturally-occurring 
proteins, the amino acid residue to which this a-amino 
terminus belongs is covalently linked to two other amino 
acid residues, although one of the linkages is through a 
side chain; the other is through the carboxyl terminus of 
that residue, through a peptide bond. This or-amino 
terminus derived from one of the amino acid sequences of 
the first segment must be distinguished from any amino 
terminus derived from the side chains of the amino acid 
sequence, typically e -amino termini derived from lysine 
side chains. 

Typically, the substantially helical amphiphilic 
acid regions of the first segment have a predominantly 
periodic secondary structure. More typically, the 
substantially helical amphiphilic amino acid regions in 
the first segment have an estimated helicity of at least 
about 80%. Preferably, the substantially helical 
amphiphilic amino acid regions in the first segment have 
an estimated helicity of at least about 85%. One 
suitable measurement technique for helicity is circular 
dichroism (CD) , but other measurement techniques such as 
optical rotatory dispersion (ORD) can be used. 

Typically, the substantially helical amphiphilic 
amino acid regions have the sequence is G- (X 1 -L-X 2 -X 3 -L-X 4 - 
G) n . In this repeating sequence, n is an integer from 2 
to 20, preferably from 3 to 6, more preferably 4. In the 
repeating sequence, X! is lysine, arginine or ornithine, 
X 2 and X 4 can each be glutamic acid or aspartic acid, and 
X 3 can be alanine, serine, or threonine. Typically, the 
sequence of the two substantially helical amphiphilic 
amino acid regions is the same, but this is not required. 
Typically, n is the same for both of the two 
substantially helical amphiphilic amino acid regions, but 
this is also not required. In some applications, it may 
be desirable to provide protein constructs in which the 
substantially helical amphiphilic regions are of 
different lengths . 
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Preferably, X { is lysine, X 2 and X 4 are each glutamic 
acid, and X 3 is alanine. Derivatives of these amino acids 
that do not .alter the relative hydrophobicity or 
hydrophilicity or disrupt the helix are also included 
within the scope of the invention. 

In one particularly preferred alternative of the 
invention, the helical amphiphilic amino acid regions 
form coiled-coil tertiary and quaternary structures (O.D. 
Monera et al . , "Comparison of Antiparallel and Parallel 
Two-Stranded a-Helical Coiled-Coils , " J. Biol. Chem. 268: 
19218-19227 (1993); N.E. Zhou et al . , J. Biol. Chem. 267: 
2664-2670 (1992) ) analogous to those of the prototypical 
coiled-coil protein tropomyosin. The stability of these 
coiled-coils is largely a result of strong interchain 
hydrophobic interactions between leucine residues in the 
seven residue repeat (S.Y.M. Lau et al . , J. Biol. Chem. 
259: 23253-13261 (1984)). These coiled-coil structures 
are likely to better mimic the proximity of transmembrane 
helices in the natural system and also ensure that a 
defined topology is maintained between the a and (3 
cytoplasmic tails. In other words, the coiled-coil can 
act as a structural template onto which the cytoplasmic 
domain of the integrin or other transmembrane protein is 
attached. This ensures that the two cytoplasmic tails 
are staggered with respect to one another in a manner 
that approximates the intact protein. 

Typically, each of the two second segments has a 
length of about 10 amino acids to about 80 amino acids; 
more typically, each of the two second segments has a 
length of about 15 amino acids to about 50 amino acids. 
The two second segments can be predominantly non-helical 
in structure. The amino acid sequences of each of the 
two second segments can be derived from the cytoplasmic 
domain of a transmembrane protein. Typically, the amino 
acid sequences of the two * second segments are derived 
from the amino acid sequences of the cytoplasmic domains 
of the subunits of a heterodimeric multisubunit 
transmembrane protein where the subunits noncovalently 
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associate in vivo, but it is also possible to prepare 
constructs according to the present invention in which 
the amino acid sequences of the second segments are 
derived from protein subunits that do not normally 
associate into a heterodimeric transmembrane protein in 
vi tro . 

In one embodiment, the amino acid segments of each 
of the two second segments are derived from the 
cytoplasmic domain of an integrin. The protein 

constructs of the present invention are particularly 
useful for mimicking integrin conformation because, in a 
heterodimeric integrin, the location of the 
transmembrane-cytoplasmic interface of both families of 
subunits is formed by conserved Trp-Lys or Tyr-Lys 
residues . These residues are proximal to each other at 
the interface, so that subunits of differing lengths have 
a predictable relative stagger. The cytoplasmic tails 
are connected to helical transmembrane stretches, which 
may be important for inducing structure along the tails, 
and both cytoplasmic tails have carboxyl termini. 

The following combinations of sequences from the or 
and (3 subunits of integrins can be used according to 
Hynes (1992) , supra, incorporated herein in its entirety 
by this reference: Subunit 0, can associate with any of 
ot lf Qf 2 , of 3 , of 4 , a s , a 6 , Q? 7 , a 8 , and 0f v - The amino acid 
sequence from (3 2 can be associated with the amino acid 
sequence from a L , or M and a x . The amino acid sequence from 
/? 3 can be associated with a IIb or a v . The amino acid 
sequence of subunit /? 4 can be associated with <* 6 . For 
subunits /3 S , (3 6 , and (3 S , the amino acid sequence can be 
associated with the amino acid sequence of a v . For (3 lf 
the amino acid sequence can be associated with the amino 
acid sequence for a 4 or a IEL . The term "associated" in 
this context means that a heterodimeric integrin in which 
the cytoplasmic domains of the sequences are as specified 
is likely to form in vivo. 

In one particularly preferred embodiment, the amino 
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acid sequence of one of the second segments is derived 
from the cytoplasmic domain of integrin a llb and the amino 
acid sequence of the other of the second segments is 
derived from the cytoplasmic domain of /3 3 . 
5 Preferably, the amino acid sequence of the first of 

the second segments is derived from residues 989-1008 of 
a nb and the amino acid sequence of the second of the 
second segments is that of residue 716-762 of jS 3 . The 
residue 1008, which is glutamic acid (E) in the natural 

10 integrin according to the DNA sequence, is preferably 

altered to glutamine (Q) in the protein construct, 
because it is believed that post- translational side-chain 
amidation of the carboxyl -terminal glutamate occurs (J.J. 
Calvete et al . , FEBS Lett. 263:43-46 (1990)). 

15 Although one particularly useful protein construct 

derives its second segments from sequences of integrins, 
the sequences of other transmembrane proteins can be 
used. These include the T-cell receptor, cytokine 
receptor, other CD antigens, or the growth hormone 

20 receptor. 

The two amino acid sequences of the first segment 
are joined at their amino- terminal residues in head- to- 
head fashion so that there is one free ar-amino terminus, 
as described above. Preferably, the first and second 

25 segments are joined through a thioether linkage. Other 

joining methods also are known in the art. 

In an alternative Type I protein construct according 
to the present invention, the first and second segments 
are joined through an oxime linkage formed between an 

30 aminooxy moiety on the amino- terminal residue of one of 

the two amino acid sequences of the first segment and an 
aldehyde residue on the amino- terminal residue of the 
other amino acid sequence of the- first segment. This 
protein construct comprises: 

35 (1) a first segment including two amino acid 

sequences joined at their amino- terminal residues in 
head- to-head fashion through an oxime linkage formed by 
reaction of an aldehyde moiety with an aminooxy moiety, 
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each of the two amino acid sequences of the first segment 
including a substantially helical amphiphilic amino acid 
region; and 



segments joined to the first segment at the carboxyl 
terminus of each of the two amino acid sequences of the 
first segment. 

The protein construct has either zero or one free amino 
terminus derived from one of the two amino acid sequences 
of the first segment and two free carboxyl termini, with 
the proviso that when the protein construct has one free 
amino terminus, the amino acid residue having the 
aminooxy moiety has an amino group derived from a side 
chain, the free amino terminus of the protein construct 
being derived either from the a-amino terminus or from 
the amino terminus of the side chain. 

In this alternative Type I protein construct, it is 
not necessary that the linkage of the two amino acid 
sequences of the first segment be through the side chain 
of a cysteine residue. If a free a-amino terminus exists 
in this alternative Type I protein construct, it is 
available for reaction. 

Typically, in the constructs of the present 
invention, the helicity of at least one of the first and 
second segments is increased in the construct over the 
helicity of the first or second segment alone, as 
measured by circular dichroisrn. 

In another embodiment of the present invention, one 
of the second segments has an amino acid sequence derived 
from the a subunit of an integrin, such as a IIb , with a 
deletion of the sequence GFFKR (SEQ ID NO: 5) • This 
deletion causes an increase of the affinity of the 
integrin for its extracellular ligand and is sometimes 
referred to as a "hinge mutation." A schematic of a 
typical Type I protein construct of the present 
invention, using second segments whose amino acid 
sequences are derived from a Ub and /3 3 integrins, is shown 
in Figure 1 . 



(2) 



two second segments, each of the second 
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Although the protein constructs disclosed above 
contain two cytoplasmic domains of transmembrane 
proteins, related techniques of chemoselect ive ligation 
can be used to generate higher-order protein constructs 
containing three, four, or more cytoplasmic domains of 
transmembrane proteins held in a conformation 
approximating the conformation in vivo. This technique 
is known generally as template-assisted protein 
synthesis, and is described in P.E. Dawson & S.B.H. Kent, 
"Convenient Total Synthesis of a 4-Helix TASP Molecule by 
Chemoselective Ligation," J. Am, Chem. Soc . 115: 7263- 
7266 (1993) . 



Another embodiment of the present invention is a 
protein construct comprising: 

(1) a first segment including two amino acid 
sequences joined at their amino- terminal residues in 
head-to-head fashion so that the first segment has one 
free a-amino terminus derived from one of the two amino 
acid sequences of the first segment and two carboxyl 
termini available for covalent linkage to the amino- 
terminal end of another amino acid segment; 

(2) two second segments, each including a 
substantially helical amphiphilic amino acid region, each 
of the second segments being covalently linked at their 
amino- terminal ends to a carboxyl terminus of the first 
segment; and 

(3) two third segments, each of the third segments 
being an amino acid sequence, each of the third segments 
being covalently linked by their amino- termini to the 
carboxyl -termini of the second segment. The protein 
construct has one free amino terminus derived from the 
first segment and two free carboxyl termini derived from 
the third segments. For convenience, this construct is 
referred to as a Type II protein construct herein. 

The second and third segments of the Type II protein 
construct are substantially equivalent to the first and 



2. 



Protein Construct With First, Second, and 
Third Segments 
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second segments of the Type I protein construct described 
above. However, the first segment includes additional 
amino acid residues. 

The first segment of the Type II construct can 
include a specific binding partner sequence having 
affinity for a specific binding partner. The specific 
binding partner sequence can specifically bind an 
antibody. 

An alternative Type II protein construct uses an 
oxime linkage between two amino acid sequences of the 
first segment as discussed above. This protein construct 
comprises : 

(1) a first segment including two amino acid 
sequences joined at their amino- terminal residues in 
head- to-head fashion through an oxime linkage formed by 
reaction of an aldehyde moiety with an aminooxy moiety so 
that the first segment has two carboxyl termini available 
for covalent linkage to the amino-terminal end of another 
amino acid segment; 

(2) two second segments, each including a 
substantially helical amphiphilic amino acid region, each 
of the second segments being covalently linked at their 
amino-terminal ends to a carboxyl terminus of the first 
segment ; and 

(3) two third segments, each of the third segments 
being an amino acid sequence, each of the third segments 
being covalently linked by their amino- termini to the 
carboxyl -termini of the second segment. 

This alternative Type II protein construct has 
either zero or one free amino terminus derived from one 
of the two amino acid sequences of the first segment and 
two free carboxyl termini, with the proviso that when the 
protein construct has one free amino terminus, the amino 
acid residue having the aminooxy moiety has an amino 
group derived from a side chain, the free amino terminus 
of the protein construct being derived either from the o;- 
amino terminus or from the amino terminus of the side 
chain. 
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B . Synthesis of Protein Constructs 

Typically, synthesis of a type I protein construct 
using a thioether linkage according to the present 
invention occurs by the following process: 

(1) providing two amino acid sequences, each amino 
acid sequence comprising a substantially helical 
amphiphilic amino acid region at its amino terminus 
covalently linked to a second amino acid sequence; and 

(2) covalently linking two amino acid sequences in 
head- to-head fashion through a thioether linkage to 
produce a protein construct having one free a-amino 
terminus derived from one of the two amino acid sequences 
and two free carboxyl termini. 

Typically, synthesis of an alternative Type I 
protein construct using an aminooxy linkage occurs by the 
following process : 

(1) providing two amino acid sequences, each amino 
acid sequence comprising a substantially helical 
amphiphilic amino acid region at its amino terminus 
covalently linked to a second amino acid sequence; and 

(2) joining the two amino acid sequences in head-to- 
head fashion through an oxime linkage between an amino 
acid residue with an aldehyde moiety and an amino acid 
residue with an aminooxy moiety to form a protein 
construct having either zero or one free amino terminus 
derived from one of the two amino acid sequences of the 
first segment and two free carboxyl termini, with the 
proviso that when the protein construct has one free 
amino terminus, the amino acid residue having the 
aminooxy moiety has an amino group derived from a side 
chain, the free amino terminus of the protein construct 
being derived either from the of-amino terminus or from 
the amino terminus of the side chain. 

Typically, synthesis of a Type II protein construct 
using a thioether linkage according to the present 
invention occurs by the following process: 

(1) providing two amino acid sequences, each amino 
acid sequence including: (i) an amino- terminal amino acid 
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segment; (ii) an intermediately situated substantially 
helical amphiphilic amino acid segment covalently linked 
at its amino terminus to the first segment; and (iii) a 
third amino acid segment covalently linked at its amino 
terminus to the second amino acid segment within the 
amino acid sequence; and 

(2) joining the two amino acid sequences in head-to- 
head fashion through a thioether linkage to form a 
protein construct having one free or-amino terminus 
derived from one of the two amino acid sequences and two 
free carboxyl termini. 

Typically, synthesis of an alternative Type II 
protein construct using an aminooxy linkage occurs by the 
following process : 

(1) providing two amino acid sequences, each amino 
acid sequence including: (i) an amino- terminal amino acid 
segment; (ii) an intermediately situated substantially 
helical amphiphilic amino acid segment covalently linked 
at its amino terminus to the first segment; and (iii) a 
third amino acid segment covalently linked at its amino 
terminus to the second amino acid segment within the 
amino acid sequence; and 

(2) joining the two amino acid sequences in head- to- 
head fashion through an oxime linkage between an amino 
acid residue with an aldehyde moiety and an amino acid 
residue with an aminooxy moiety to form a protein 
construct having either zero or one free amino terminus 
derived from one of the two amino acid sequences of the 
first segment and two free carboxyl termini, with the 
proviso that when the protein construct has one free 
amino terminus, the amino acid residue having the 
aminooxy moiety has an amino group derived from a side 
chain, the free amino terminus of the protein construct 
being derived either from the a-amino terminus or from 
the amino terminus of the side chain. 

The amino acid sequences comprising the first and 
second segments or the first, second, and third segments 
are generally synthesized through peptide synthesis. The 
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principles of peptide synthesis, particularly solid-phase 
peptide synthesis, are well known in the art and are 
described, for example, in M. Bodanszky, "Principles of 
Peptide Synthesis" (2d ed., Springer, Berlin, 1993). 
Typically, synthesis is performed on a solid phase, and 
involves activation and coupling, the reversible blocking 
of amino and carboxyl groups, and eventual deprotection . 
A particularly useful method of solid phase protein 
synthesis is described in M. Schnolzer, "In Situ 
Neutralization in Boc-Chemistry Solid Phase Peptide 
Synthesis," Int. J. Peptide Protein Res. 40:180-193 
(1992), incorporated herein by this reference. This 
method uses a t -butyloxycarbonyl protecting group. 

Alternatively, the amino acid sequences needed for 
assembly of the constructs can be produced through 
recombinant expression techniques that yield the desired 
products. These techniques are well known in the art and 
need not be described further here. 

The synthesized segments are then covalently linked 
in head-to-head fashion. Preferably, a free cysteine 
residue on one of the amino acid sequences is condensed 
with a bromoacetyl moiety on the other amino acid 
sequence to yield a protein construct with a thioether 
linkage. Such techniques are described, for example in, 
T.W. Muir & S.B.H. Kent, "The Chemical Synthesis of 
Proteins." Curr . Qpin. Biotechnology 4 : 420-427 (1993) and 
in M . Schnolzer & S.B.H. Kent, Science 256:221-225 
(1992), both of which are incorporated herein by this 
reference . 

C. Applications 

The protein constructs of the present invention have 
a number of applications based on the ability to maintain 
the cytoplasmic tails of the construct in a configuration 
that is equivalent or similar to the configuration 
predominating in vivo while maintaining solubility and 
stability in an aqueous system, namely in staggered, 
parallel, and proximal topology. For example, these 
protein constructs can. be used to detect intracellular 
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molecules capable of binding to integrins and modulating 
signals by inside-out signaling. Alternatively, these 
molecules can be used in vivo to disrupt or modulate 
inside-out signaling by binding to the cells in a manner 
such that the cytoplasmic domains of these protein 
constructs compete for intracellular molecules with the 
natural integrins. Because these protein constructs do 
not contain the extracellular ligand-binding sites of 
integrins, they would then disrupt inside-out signaling. 
This would be particularly useful in conditions in which 
overactivity of integrins is ' involved, such as 
inflammation, thrombosis, and malignancy. 

Additionally, protein constructs according to the 
present invention could be used to detect molecules 
capable of binding to the intracellular or cytoplasmic 
domain of integrins and other transmembrane molecules in 
vivo. The protein constructs could be modified with 
aldehyde or aminooxy groups and cross-linked to affinity 
columns for chromatography. Affinity chromatography is 
well known in the art and need not be described here; 
typical procedures are described, for example in G.T. 
Hermanson et al., "Immobilized Affinity Ligand 
Techniques" (Academic Press, San Diego, 1992) , 
incorporated herein by this reference. Other derivatized 
constructs can be made with various derivatives for 
affinity chromatography, according to crosslinking 
reactions well known in the art. Constructs according to 
the present invention can also be used in screening for 
oncogens . 

II . CHIMERIC TRANSMEMBRANE PROTEINS 

Another approach to the structure and function of 
transmembrane proteins, particularly integrins, is the 
use of chimeric transmembrane proteins in which the 
cytoplasmic region from one transmembrane protein is 
covalently fused to the extracellular and transmembrane 
region of another transmembrane protein. Unlike the 
protein constructs discussed above, this fusion is 
accomplished by genetic engineering techniques, and each 
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of the two subunits of the heterodimeric chimeric protein 
is synthesized by a normal protein synthesis mechanism in 
which the chain is synthesized as one unit from the amino 
terminus to the carboxy terminus . 
5 In these chimeric transmembrane proteins, the 

cytoplasmic domain of an integrin can be fused to the 
transmembrane and extracellular domains of a non- integrin 
transmembrane protein, such as the Tac subunit of the 
human IL-2 receptor. Alternatively, the cytoplasmic 

10 domain of one integrin can be fused to the transmembrane 

and extracellular domains of another integrin. 

Examples include a chimeric integrin protein 
comprising the extracellular and transmembrane domains of 
the Tac subunit of the IL-2 receptor covalently linked to 

15 the cytoplasmic domain of integrin /3 3 , a chimeric integrin 

comprising the transmembrane and extracellular domains of 
the Tac subunit of the IL-2 receptor and the cytoplasmic 
domain of integrin a llb . 

In the chimeric integrin in which the cytoplasmic 

20 domain is that of integrin (S 3 , the amino acid in the 

cytoplasmic domain of integrin (3 3 that is amino acid 752 
in the complete integrin 0 3 molecule can be mutated from 
a serine residue to a proline residue. This mutation is 
associated with defective inside-out signalling in intact 

25 integrins. 

Another aspect of the present invention is a 
heterodimeric chimeric integrin in which the 
extracellular and transmembrane domains of human integrin 
a IIb /? 3 are joined to the cytoplasmic domains of human 

3 0 integrin a 5 (3 y , so that the a. subunit has the extracellular 

and transmembrane domains of a Ub and the cytoplasmic 
domain of a 5 , and the (3 subunit has the extracellular and 
transmembrane domains of j8 3 and the cytoplasmic domain of 

Pi. 

3 5 Chimeric integrins can be constructed in a number of 

vectors well known in the art, such as the CDM8 vector 
(A. Aruffo & B. Seed, Proc . Natl. Acad. Sci . USA 84:8573- 
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8577 (1987)), or can be constructed in the CMV-IL2R 
vector for production of chimeric integrins in which the 
extracellular and transmembrane domains are derived from 
Tac (S.E. LaFlamme et al . , J. Cell Biol. 117:437-447 
(1992) ) . Alternatively, other vectors known in the art 
can be used. 

General cloning techniques are well known in the art 
and need not be described further here; they are 
described, for example, in J. Sambrook et al . , "Molecular 
Cloning: A Laboratory Manual" (Cold Harbor Laboratory 
Press, Cold Spring Harbor, New York, 1989), vol, 1-3; B. 
Perbal, "A Practical Guide to Molecular Cloning" (John 
Wiley & Sons, New York, 1988); D.V. Goeddel, ed. , "Gene 
Expression Technology" (Methods in Enzymology , vol. 185, 
Academic Press, Inc., San Diego, 1991) and other sources. 
In general, methods for selecting the vector, choosing 
the appropriate cleavage points for restriction 
endonucleases, ligating the appropriately cleaved or 
synthesized segment into the vector, introducing the 
chimeric vector containing the DNA segment to be 
expressed into a suitable host cell by techniques such as 
transf ection, lipofection, and other techniques, and 
expressing the desired protein are well known in the art 
and described in these references. Therefore, they need 
not be elaborated upon further here. 

Mutations can be introduced into the chimeric 
integrins by site-specific mutagenic techniques, such as 
those involving the polymerase chain reaction technique 
(J.C. Loftus et al. , Science 249:915-918 (1990) ) , as well 
as other site-specific mutagenic techniques described in 
the cloning references given above. 

The present invention also encompasses nucleic acid 
sequences encoding chimeric integrins as described above, 
and nucleic acid sequences operatively linked to at least 
one control element for transcription of the nucleic acid 
sequence. Suitable control elements are well known in 
the art and include promoters and enhancers. These are 
described in the references cited above, namely Maniotis 
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et al . , Perbal, and Goeddel . Promoters include the Rous 
sarcoma virus LTR promoter, the SV4 0 early promoter, the 
herpes simplex thymidine kinase promoter, the mouse 
mammary tumor virus LTR promoter, the adenovirus major 
late promoter, the murine metallothionein promoter, the 
avian sarcoma virus promoter, the human cytomegalovirus 
immediate early promoter, the interferon /? promoter, the 
Drosophila heat shock protein 70 promoter and the hMTII 
promoter. Other promoters and enhancer elements are well 
known in the art and need not be described further here . 

A further aspect of the invention is vectors 
comprising the nucleic acid sequence coding for a 
chimeric integrin operatively linked to at least one 
control element. Vectors according to the present 
invention are capable of transfecting at least one 
eukaryotic host for expression of the chimeric integrin 
encoded by the nucleic acid sequence. Vectors are well 
known in the art and need not be described further here; 
they are described in the references cited above. 
Further examples of vectors are given in the examples. 

These chimeric integrins are capable of blocking 
activation of human integrins. A method for blocking the 
activation of a human integrin can comprise the step of 
expressing a chimeric integrin whose extracellular and 
transmembrane domains are derived from the Tac subunit of 
the human IL-2 receptor and whose cytoplasmic domain is 
the cytoplasmic domain of the human (3 { integrin in a 
quantity sufficient to inhibit high affinity ligand 
binding by the cellular integrin. This is a competitive 
method. The preparation of the chimeric integrin whose 
extracellular and transmembrane domains are derived from 
the Tac subunit and whose cytoplasmic domain is the 
cytoplasmic domain of the human 0, integrin is described 
in Example 4 . 

As another aspect of the present invention, a 
similar method can be used for blocking the activation of 
a human integrin comprising the step of expressing a 
chimeric integrin comprising the extracellular 
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transmembrane domains of the Tac subunit covalently 
linked to the cytoplasmic domain of integrin /? 3 . 

These techniques block inside -out signal 
transduction and represents an alternative way of 
5 modulating integrin activity other than using small 

molecular weight competitive inhibitors of ligand binding 
to the extracellular domain (M.H. Ginsberg et al . , J\ 
Biol. Chem. 260:3931-3 936 (1985); M.J. Humphries et al . 
(1986), supra). The method described herein involves 

10 inhibition of binding to the intracellular domain. Since 

integrin activation involves cell -type specific factors, 
such inhibition could be cell type-specific. This 
provides a totally new method for controlling integrin 
behavior in disease states such as thrombosis, 

15 inflammation, and tumor invasion and metastasis. 

The present invention is illustrated by the 
following Examples. The Examples are for illustrative 
purposes only and are not intended to limit the 
invention. 

20 EXAMPLES 

Example 1 

Synthesis of Type I Protein Construct and 
Other Model Peptides 
The Type I protein construct shown schematically in 
25 Figure IB and in Figure 2 was synthesized, along with the 

following molecules used as controls: (1) a molecule 
that consists of the cytoplasmic domain of integrin a IIb 
with the carboxyl terminal amino acid changed from 
glutamic acid to glutamine to reflect likely in vivo 
3 0 amidation and the cytoplasmic domain of integrin (S 2 

covalently joined through a thioether linkage, but 
without the helical segments (MP- 2 in Figure 2) ; (2) a 
molecule consisting of a helix dimer covalently joined 
through a thioether linkage without the cytoplasmic 
3 5 domains (Helix-Dimer in Figure 2) . The Type I protein 

construct is identified in Figure 2 and in other figures 
as MP-1 as an alternative designation. 

Also, the cytoplasmic tails of a IIb and j3 3 were 
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